首页> 外文OA文献 >Segmentation Conditional Random Fields (SCRFs): A New Approach for Protein Fold Recognition
【2h】

Segmentation Conditional Random Fields (SCRFs): A New Approach for Protein Fold Recognition

机译:分割条件随机场(SCRFs):蛋白质折叠识别的新方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Protein fold recognition is an important step towards understanding protein three-dimensional structures and their functions. A conditional graphical model, i.e., segmentation conditional random fields (SCRFs), is proposed as an effective solution to this problem. In contrast to traditional graphical models, such as the hidden Markov model (HMM), SCRFs follow a discriminative approach. Therefore, it is flexible to include any features in the model, such as overlapping or long-range interaction features over the whole sequence. The model also employs a convex optimization function, which results in globally optimal solutions to the model parameters. On the other hand, the segmentation setting in SCRFs makes their graphical structures intuitively similar to the protein 3-D structures and more importantly provides a framework to model the long-range interactions between secondary structures directly. Our model is applied to predict the parallel β -helix fold, an important fold in bacterial pathogenesis and carbohydrate binding/cleavage. The cross-family validation shows that SCRFs not only can score all known β -helices higher than non-β -helices in the Protein Data Bank (PDB), but also accurately locates rungs in known beta-helix proteins. Our method outperforms BetaWrap, a state-of-the-art algorithm for predicting beta-helix folds, and HMMER, a general motif detection algorithm based on HMM, and has the additional advantage of general application to other protein folds. Applying our prediction model to the Uniprot Database, we identify previously unknown potential β -helices.
机译:蛋白质折叠识别是理解蛋白质三维结构及其功能的重要步骤。提出了一种条件图形模型,即分段条件随机字段(SCRF),作为对该问题的有效解决方案。与传统的图形模型(例如隐马尔可夫模型(HMM))相反,SCRF遵循判别方法。因此,可以灵活地在模型中包括任何特征,例如整个序列上的重叠或远程交互特征。该模型还采用了凸优化函数,这导致了模型参数的全局最优解。另一方面,SCRF中的分割设置使它们的图形结构直观上类似于蛋白质3-D结构,更重要的是提供了一个框架,可以直接对二级结构之间的远程相互作用进行建模。我们的模型用于预测平行的β螺旋折叠,这是细菌发病机理和碳水化合物结合/裂解的重要折叠。跨族验证显示,SCRF不仅可以对蛋白质数据库(PDB)中所有已知的β-螺旋得分都高于非β-螺旋,而且可以准确地定位已知β-螺旋蛋白中的梯级。我们的方法优于用于预测β-螺旋折叠的最新算法BetaWrap和基于HMM的常规基序检测算法HMMER,并且具有普遍应用于其他蛋白质折叠的优势。将我们的预测模型应用于Uniprot数据库,我们可以确定以前未知的潜在β-螺旋。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号